Fine-grain voice strength estimation from vowel spectral cues

نویسندگان

  • Jean-Sylvain Liénard
  • Claude Barras
چکیده

This study investigates the possibility to recover the voice strength, i.e. the sound level produced by the speaker, from the signal recorded. The dataset consists of a set of isolated vowels (720 tokens) recorded in a situation where two interlocutors interacted orally at a distance comprised between 0.40 and 6 meters, in a furnished room. For each token, voice strength is measured at the intensity peak, and several sets of acoustic cues are extracted from the signal spectrum, after frequency weighting and intensity normalization. In the first phase, the tokens are grouped into increasing voice strength categories. Discriminant Analysis produces a classifier which takes into account all the signal dimensions implicitly coded in the set of cues. In the second phase, the cues of a new token are given to the classifier, which in turn produces its distances to the groups, providing the basis for estimating the unknown voice strength. The quality of the process is evaluated either in self-consistency mode or by cross-validation, i.e. by comparing the estimate with the value initially measured on the same token. The statistical margin of error is quite low, of the order of 3 dB, depending on the sets of cues used. keywords: vocal effort, vocal intensity, voice quality, discriminant analysis

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Remote sensing of sediment characteristics by optimized echo-envelope matching.

A sediment geoacoustic parameter estimation technique is described which compares bottom returns, measured by a calibrated monostatic sonar oriented within 15 degrees of vertical and having a 10 degree-21 degree beamwidth, with an echo envelope model based on high-frequency (10-100 kHz) incoherent backscatter theory and sediment properties such as: mean grain size, strength, and exponent of the...

متن کامل

Why a phenomenology of vowel sounds is needed

In literature, there is an extensive and often controversial debate on the primary acoustic and perceptual cues of vowel quality, resulting in two main viewpoints that these cues are contained in either the formants, or, alternatively, in the spectral shape. However, in our understanding, one aspect is highly underestimated: the fact that any spectral representation of vowel-quality is directly...

متن کامل

Observation of empirical cumulative distribution of vowel spectral distances and its application to vowel based voice conversion

A simple and fast voice conversion method based only on vowel information is proposed. The proposed method relies on empirical distribution of perceptual spectral distances between representative examples of each vowel segment extracted using TANDEM-STRAIGHT spectral envelope estimation procedure [1]. Mapping functions of vowel spectra are designed to preserve vowel space structure defined by t...

متن کامل

Spectral and temporal cues in cochlear implant speech perception.

OBJECTIVE Taking advantage of the flexibility in the number of stimulating electrodes and the stimulation rate in a modern cochlear implant, the present study evaluated relative contributions of spectral and temporal cues to cochlear implant speech perception. DESIGN Four experiments were conducted by using a Research Interface Box in five MED-EL COMBI 40+ cochlear implant users. Experiment 1...

متن کامل

Forensic voice comparison with monophthongal formant trajectories - a likelihood ratio-based discrimination of "schwa" vowel acoustics in a close social group of young Australian females

An experiment is described relating to estimation of strength of evidence in likelihood ratio-based forensic voice comparison. It is asked whether a better performance is obtained from point estimation of formant pattern targets in monophthongal vowel acoustics rather than formant trajectories. The hypothesis is tested on non-contemporaneous recordings of a custom-built challenging database of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013